WHAT IS CLAIMED IS: 

1 C^ v ^\ 1 • A computer-implemented method of displaying a document using a 

2 wowser, the method compnsing: 

3 accessing the document; 

4 searching the document to identify text patterns in the document which are 

5 relevant to a plurality of concepts; 

6 marking locations of the text patterns in the document; and 

7 displaying the document using the browser such that the text patterns in 

8 the document which are relevant to the plurality of concepts are annotated. 

1 2. The method of claim 1 wherein the browser is an Internet Explorer 

2 browser and the searching and marking use information about the document stored in a 

3 Document Object Model configured by the Internet Explorer browser. 

1 3. The method of claim 1 wherein searching the document comprises: 

2 determinin^^t(^atterns associated with the plurality of concepts; and 

3 searching the v document to identify text patterns in the document which 

4 match the text patterns associa^d with the plurality of concepts. 

1 4. The method of claim 1 wherein: 

2 the browser is an Internet Explorer browser; and 

3 the searching and marking is performed using methods provided by an 

4 IHTMLTxtRange interface. \ 

1 5. The method of claim 1 wherein: 

2 the browser is an Internet Explorer browser; and 

3 the searching and marking i\ performed using methods provided by an 

4 IMarkupServices interface. \ 

1 6. The method of claim 1 wherein marking the locations of the text 

2 patterns in the document comprises inserting annotation tags in front of and after each 

3 text pattern identified in the document to be relevant to a concept from the plurality of 

4 concepts, wherein the annotation tags for each tex\ pattern identify the concept to which 

5 the text pattern is relevant. \ 
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7. The method of claim 6 wherein displaying the document 
comprises: \ 

for each text pattern identified as being relevant to a concept from the 
plurality of concepts: \ 

determining the concept to which the text pattern is relevant from 
\ 

the annotation tags surrounding the^text pattern; 

determining style information to be used for annotating the text 

\ 

pattern, wherein the style information's associated with the concept to which the text 
pattern is relevant; and 

annotating the text pattern based on the style information. 

8. The method of claim 7 wherein annotating the text pattern 
comprises highlighting the text pattern in a color indicated by the style information. 

9. The method of claim 1 further comprising: 

calculating a score for each concept in the plurality of concepts, the score 
indicating relevance of the document to the concept; and 

Vc 

\ 



displaying a relevance indicator for each concept in the plurality of 



concepts based on the score for the concept. 

10. The method of claim 9 wherein calculating the score for each 
concept in the plurality of concepts comprises: ^ 

for each concept in the plurality of concepts: 

determining frequency of text patterns in the document identified 
as being relevant to the concept; and \ 

calculating the score based on the frequency of the text patterns. 

11. A computer- implemented method of displaying a multi-page 
document using a browser, the method comprising: 

accessing a multi-page document; 

displaying a section of the multi-page document in a first viewing area of a 
display; \ 

determining information about the contents of the multi-page document; 

displaying a thumbnail image in a second viewing area of the display 
based on the contents information determined from the multi-page document, the 
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thumbnail image displaying the contents of the multi-page document in a continuous 
form; and 

emphasizing an area of\the thumbnail image corresponding to the section 

of the multi-page document displayed in the first viewing area. 

\ 

12. The method of claim 1 1 wherein: 

\ 

determining information about the contents of the multi-page document 
comprises: \^ 

determining dimension information for the contents; and 
determining coordinate information for the contents; and 
displaying the thumbnail image comprises: 

displaying the contents in the thumbnail image based on the 
dimension and coordinate information for the contents. 

13. The method of claim 12 wherein displaying the contents in the 
thumbnail image based on the dimension and coordinate information for the contents 
comprises: 

for each content: 

determining position of the content in the thumbnail image by 
dividing the coordinate and dimension information for the content by a reduction ratio. 

14. The method of claim 1 1 wherein determining information about the 
contents of the multi-page document comprises: ^ 

determining information about text entities contained in the multi-page 

document; 

determining dimension and coordinate informatibn for the text entities; 
determining if the text entities are relevant to a plurality of concepts; and 
if a text entity is relevant to a concept in the plurality of concepts: 

associating the text entity with style information for the concept, 

the style information indicating a manner of annotating text entities yi the multi-page 

document which are relevant to the concept. 
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1 5 . The methibd of claim 1 4 wherein displaying the thumbnail image 

comprises: 

if a text entity is relevant to a concept in the plurality of concepts: 

displaying thk text entity in the thumbnail image using the style 
information for the concept. t 

Y 

16. The method of claim 15 further comprising: 
modifying the style information for a concept; 

in response to the modification: 

identifying text entities in the multi-page document which are 
relevant to the concept; and ^ 

dynamically changing the display of the identified text entities in 
the thumbnail image based on the modified style information. 

17. The method of claim 1 1 wherein determining information about the 
contents of the multi-page document comprises: \ 

determining information about forms contained in the multi-page 
document; and ^ 

determining dimension and coordinateanformation for the forms. 

18. The method of claim 1 1 wherein determining information about the 
contents of the multi-page document comprises: \^ 

determining information about image elements contained in the multi-page 

document; and 



\ 



determining dimension and coordinate information for the image elements. 

19. A computer- implemented method of displacing a multi-page 
document using a browser, the method comprising: 
accessing the multi-page document; 

searching the multi-page document to identify text patterns in the 
document which are relevant to a plurality of concepts; 

marking locations of the text patterns in the document; 

displaying a section of the multi-page document in a first ^iewing area of a 
display such that the text patterns in the multi-page document which are r^Jevant to a 
plurality of concepts are annotated; 
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1 0 determining information about the contents of the multi-page document; 

1 1 displaying a thumbnail image in a second viewing area of the display 

12 based on the contents information determined from the multi-page document, the 

1 3 thumbnail image displaying the contents of the multi-page document in a continuous 

14 form; and \ 

15 emphasizing an area of the thumbnail image corresponding to the section 

16 of the multi-page documen^displayed in the first viewing area. 

1 20. A system for displaying a document using a browser, the system 

2 comprising: . 

3 a processor; and \ 

4 a memory coupleci to the processor and configured to store a plurality of 

5 modules for execution by the processor, the plurality of modules module including: 

6 a module for accessing the document; 

7 a module forsearching the document to identify text patterns in the 

8 document which are relevant to a plurality of concepts; 

9 a module for marking locations of the text patterns in the 

10 document; and 

11 a module for displ^ing the document using the browser such that 

12 the text patterns in the document which ar^ relevant to the plurality of concepts are 

1 3 annotated. 

1 21 . The system of claim 2& wherein the browser is an Internet Explorer 

2 browser and the module for searching and the module for marking use information about 

3 the document stored in a Document Object Mo<\el configured by the Internet Explorer 

4 browser. 

1 22. The system of claim 20 wherein the module for searching the 

2 document further comprises: ^- ^ 




3 a module for determming^text patterns associated with the plurality of 

4 concepts; and \^ 

5 a module for searching the document to identify text patterns in the 

6 document which match the text patterns^associated with the plurality of concepts. 
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1 23. The system of claim 20 wherein: 

2 the browser is an Internet Explorer browser; and 

3 the modules for searching and marking use methods provided by an 

4 IHTMLTxtRange interface. 

1 24. The systein of claim 20 wherein: 

2 the browser is an Ijnternet Explorer browser; and 

3 the modules for searching and marking use methods provided by an 

4 IMarkupServices interface. 

1 25. The system ok claim 20 wherein the module for marking the 

2 locations of the text patterns in the document comprises a module for inserting annotation 
O 3 tags in front of and after each text pattern identified in the document to be relevant to a 

4 concept from the plurality of concepts) wherein the annotation tags for each text pattern 

U 5 identify the concept to which the text pattern is relevant. 

sTi I 

ST - I 
fl I 

Hi 1 26. The system of clapn 25 wherein the module for displaying the 

^ 2 document comprises: 

O 3 for each text pattern identified as being relevant to a concept from the 

2 4 plurality of concepts: 

y3 5 a module for determining the concept to which the text pattern is 

p 6 relevant from the annotation tags surroimdjing the text pattern; 

7 a module for determining style information to be used for 



8 annotating the text pattern, wherein the stylL information is associated with the concept to 

9 which the text pattern is relevant; and \ 

10 a module for annotating the text pattern based on the style 

1 1 information. 

1 27. The system of claim 26 wherein the module for annotating the text 

2 pattern comprises a module for highlighting the text pattern in a color indicated by the 

3 style information. 
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1 28. The system of claim 2<0 wherein the plurality of modules stored in 

i 

2 the memory further comprise: 



3 a module for calculating a score for each concept in the plurality of 

4 concepts, the score indicating relevance of the document to the concept; and 

5 a module for displaying a relevance indicator for each concept in the 

6 plurality of concepts based on the score for the concept. 

1 29. The system of claim 28 |vherein the module for calculating the 

2 score for each concept in the plurality of concepts comprises: 

3 for each concept in the plurality! of concepts: 

i 

4 a module for determining frequency of text patterns in the 

5 document identified as being relevant to the concept; and 
^ 6 a module for calculating the score based on the frequency of the 
ff 1 7 text patterns. 

g 1 30. A system for displaying a multi-page document using a browser, 

~£ 2 the system comprising: 

y3 

s 3 a processor; and 

S 4 a memory coupled to the processor and configured to store a plurality of 

^ 5 modules for execution by the processor, the pluiality of modules module including: 
O 6 a module for accessing a multi-page document; 

- 7 a module for displaying a jsection of the multi-page document in a 

8 first viewing area of a display; 

9 a module for determining information about the contents of the 

10 multi-page document; 

11 a module for displaying a thUmbnail image in a second viewing 

12 area of the display based on the contents information determined from the multi-page 

13 document, the thumbnail image displaying the contents of the multi-page document in a 

14 continuous form; and 

15 a module for emphasizing an afrea of the thumbnail image 

16 corresponding to the section of the multi-page document displayed in the first viewing 

17 area. 
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3 1 . The system of claim 30 wherein: 

the module for determining information about the contents of the multi- 
page document comprises: 

a module for determining dimension information for the contents; 
and | 

a module for determining coordinate information for the contents; 
and | 

the module for displaying the thumbnail image comprises: 

a module for displaying the qontents in the thumbnail image based 
on the dimension and coordinate information for the contents. 



32. The system of claim 3 1 wherein the module for displaying the 
contents in the thumbnail image based on the dimension and coordinate information for 
the contents comprises: 

for each content: 

a module for determining position of the content in the thumbnail 
image by dividing the coordinate and dimension information for the content by a 
reduction ratio. 

33. The system of claim 30 wherein the module for determining 
information about the contents of the multi-page document comprises: 

a module for determining information about text entities contained in the 
multi-page document; 

a module for determining dimension and coordinate information for the 

text entities; 

a module for determining if the text entities are relevant to a plurality of 
concepts; and \ 

if a text entity is relevant to a concept in the plurality of concepts: 

a module for associating the text entity! with style information for 
the concept, the style information indicating a manner of annotating text entities in the 
multi-page document which are relevant to the concept. 
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34. The System of claim 33 wherein the module for displaying the 
thumbnail image comprises: 

if a text entity is^relevant to a concept in the plurality of concepts: 

a module for displaying the text entity in the thumbnail image 
\ 

using the style information for the concept. 

35. The system of claim 34 wherein the plurality of modules stored in 
the memory further comprise: \ 

a module for modifying the style information for a concept; 



i 1 



in response to the modification: 

a module for identifying text entities in the multi-page document 
which are relevant to the concept; and 

a module for dynamically changing the display of the identified 
text entities in the thumbnail image based\on the modified style information. 

36. The system of claim BO wherein the module for determining 
information about the contents of the multi-page document comprises: 

a module for determining infoppiation about forms contained in the multi- 
page document; and 

a module for determining dimen^on and coordinate information for the 

forms. 

37. The system of claim 30 wherein the module for determining 
information about the contents of the multi-page document comprises: 

a module for determining information\about image elements contained in 
the multi-page document; and 

a module for determining dimension ani( coordinate information for the 
image elements. 

38. A system for displaying a multi-pa^e document using a browser, 
the system comprising: 

a processor; and 

a memory coupled to the processor and configured to store a plurality of 
modules for execution by the processor, the plurality of modules module including: 

a module for accessing the multi-page document; 
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a module for searching the multi-pag^document to identify text 
patterns in the document which are relevant to a plurality of concepts; 

a module for marking locations of the text patterns in the 
document; j 

a module for displaying a section^of the multi-page document in a 
first viewing area of a display such that the text patterns in the multi-page document 
which are relevant to a plurality of concepts are annotated; 

a module for determining information about the contents of the 
multi-page document; j 

a module for displaying a thumbnail image in a second viewing 
area of the display based on the contents information determined from the multi-page 
document, the thumbnail image displaying the contents of the multi-page document in a 
continuous form; and / 

a module for emphasizing/an area of the thumbnail image 
corresponding to the section of the multi-page document displayed in the first viewing 
area. 

39. A computer program product stored on a computer readable 
storage medium for displaying a document usipg a browser, the computer program 
product comprising: 

code for receiving accessing the document; 

code for searching the document to identify text patterns in the document 
which are relevant to a plurality of concepts;/ 

code for marking locations ofithe text patterns in the document; and 

code for displaying the document using the browser such that the text 



patterns in the document which are relevant 



storage medium for displaying a multi-pag* 
program product comprising: 



to the plurality of concepts are annotated. 



40. A computer program product stored on a computer readable 



document using a browser, the computer 



code for accessing a multi-page document; 
code for displaying a section of the multi-page document in a first viewing 
area of a display; 
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7 code for determining information about the contents of the multi-page 

8 document; ] 

9 code for displaying a thumbnail image in a second viewing area of the 

1 0 display based on the contents information determined from the multi-page document, the 

1 1 thumbnail image displaying the contents of the multi-page document in a continuous 

12 form; and ^ 

13 code for emphasizing an area of the thumbnail image corresponding to the 

14 section of the multi-page document displayed in the first viewing area. 



41 . A computer program product stored on a computer readable 
storage medium for displaying a multi-page document/using a browser, the computer 
program product comprising: / 

code for accessing the multi-page document; 

code for searching the multi-page docimient to identify text patterns in the 
document which are relevant to a plurality of concepts* 

code for marking locations of the text patterns in the document; 
8 code for displaying a section of the mulm-page document in a first viewing 

l of a display such that the text patterns in the multi-page document which are relevant 
plurality of concepts are annotated; I 

code for determining information about trie contents of the multi-page 



area 
10 to a 



11 

12 document; 
13 



, i 

code for displaying a thumbnail image ink second viewing area of the 
display based on the contents information determined from the multi-page document, the 
thumbnail image displaying the contents of the multi-page document in a continuous 
form; and \ 

code for emphasizing an area of the thumbnail image corresponding to the 
nulti-page document displayed in the first viewing area. 



1 8 section of the multi-page 
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